Goto

Collaborating Authors

 hash function


e468a76212a58c1af94a3d235151944a-Supplemental-Conference.pdf

Neural Information Processing Systems

Reproducibility319 The backbone recommendation model, DLRM by Naumov et al. [2019], has an open-source PyTorch320 implementation available on Github which includes an implementation of CE. For CCE you need a321 fast library for K-means. We recommend the open-sourced implementation by Johnson et al. [2019]322 for better performance, but you can also use the implementation in Scikit-learn [Pedregosa et al.,323 2011]. The baseline result should be straightforward to reproduce as we closely follow the instructions324 provided by Naumov et al. [2019]. For the CE methods, we only need to change two functions in325 the code: create_emband apply_emb. We suggest using a class for each CE method; see Figure 3.326 For the random hash function, one could use a universal hash function or numpy.random.randint.327





Preliminaries

Neural Information Processing Systems

Given a T-periodic function f(x), its Fourier series representation is a decomposition into a sum of Fourier basis functions.



Optimizing affinity-based binary hashing using auxiliary coordinates

Neural Information Processing Systems

In supervised binary hashing, one wants to learn a function that maps a highdimensional feature vector to a vector of binary codes, for application to fast image retrieval. This typically results in a difficult optimization problem, nonconvex and nonsmooth, because of the discrete variables involved. Much work has simply relaxed the problem during training, solving a continuous optimization, and truncating the codes a posteriori. This gives reasonable results but is quite suboptimal. Recent work has tried to optimize the objective directly over the binary codes and achieved better results, but the hash function was still learned a posteriori, which remains suboptimal. We propose a general framework for learning hash functions using affinity-based loss functions that uses auxiliary coordinates. This closes the loop and optimizes jointly over the hash functions and the binary codes so that they gradually match each other. The resulting algorithm can be seen as an iterated version of the procedure of optimizing first over the codes and then learning the hash function. Compared to this, our optimization is guaranteed to obtain better hash functions while being not much slower, as demonstrated experimentally in various supervised datasets.



Practical Hash Functions for Similarity Estimation and Dimensionality Reduction

Neural Information Processing Systems

Hashing is a basic tool for dimensionality reduction employed in several aspects of machine learning. However, the perfomance analysis is often carried out under the abstract assumption that a truly random unit cost hash function is used, without concern for which concrete hash function is employed. The concrete hash function may work fine on sufficiently random input. The question is if it can be trusted in the real world when faced with more structured input. In this paper we focus on two prominent applications of hashing, namely similarity estimation with the one permutation hashing (OPH) scheme of Li et al. [NIPS'12] and feature hashing (FH) of Weinberger et al. [ICML'09], both of which have found numerous applications, i.e. in approximate near-neighbour search with LSH and large-scale classification with SVM.


#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Neural Information Processing Systems

Count-based exploration algorithms are known to perform near-optimally when used in conjunction with tabular reinforcement learning (RL) methods for solving small discrete Markov decision processes (MDPs). It is generally thought that count-based methods cannot be applied in high-dimensional state spaces, since most states will only occur once. Recent deep RL exploration strategies are able to deal with high-dimensional continuous state spaces through complex heuristics, often relying on optimism in the face of uncertainty or intrinsic motivation. In this work, we describe a surprising finding: a simple generalization of the classic count-based approach can reach near state-of-the-art performance on various high-dimensional and/or continuous deep RL benchmarks. States are mapped to hash codes, which allows to count their occurrences with a hash table.